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Abstract 

We give a simple way to generate an infinite number of solutions to Pell's equation — 
Dy^ = 1, requiring only basic matrix arithmetic and no knowledge of irrational numbers. We 
illustrate the method for D — 2,7 and 61. Connections to the Stern-Brocot tree and universal 
geometry are also discussed. 

1 Introduction 

For D a positive square-free integer, Pell's equation is 

- Dy^ ^ 1. 

This is perhaps the most important Diophantine equation. Its history goes back to the ancient 
Greeks, to Archimedes' Cattle Problem (see and [10]), to Brahmagupta and Bhaskara, to Fermat 
and Euler, and it was Lagrange who finally established the main fact: 

Theorem 1 Pell's equation has an infinite number of integer solutions. 

Proofs are found in many books and articles on number theory, see for example [T], [2], [7]. 
Arguments based on continued fractions for quadratic irrationals are common. The aim of this 
paper is to give a self-contained, simplified proof which only requires basic knowledge of matrix 
multiplication and integer arithmetic. So you need not know what is meant by an irrational number. 

We follow classical ideas of Lagrange and Gauss, and also insights due to Conway ^ and others. 
The use of L and R matrices is not new, see for example [S] and [5] , and the computation of the 
fundamental automorphism of a binary quadratic form as described in [31 Section 6.12] also avoids 
irrationalities — but our approach is simpler and requires a minimum of background knowledge. 

We show how the method works explicitly in a few special cases, including the famous historical 
example of D = 61, and mention a palindromic aspect of the solution which also sheds light on the 
negative Pell equation — Dy^ = — 1. While we generate an infinite number of integer solutions, 
we do not address the issue of whether or not all solutions are obtained by this method. 

At the end of the paper we briefly connect our solution to paths in the Stern-Brocot tree, and 
mention the connections with universal geometry. 
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We need just a few basic definitions. A quadratic form is an expression in variables x and y 
of the form 

Q (x, y) = ax^ + 2bxy + cy^ (1) 



where a,b and c are integers. We write Q = {a,b,c) for short. Note that we are following Gauss' 
convention in using an even coefficient of xy. For a (column) vector v = {x, y) define 

Q{v)^Q {X, y) = {X, y) ^) (^^ = v^Av. 



The symmetric matrix 

b 



A: 



b 



c 



is the matrix of the quadratic form Q. The number ac — b^ is the determinant of Q. 

The quadratic form Q is balanced if a > and c < 0. For example, the Pell quadratic form 

Qd {x, y)=x^ - Dy"^ 

for > is balanced since its matrix is 

2 Equivalence of forms 

If Q is a quadratic form with matrix A and M is an invertible integral 2x2 matrix whose inverse 
also has integer entries, then we may define a new quadratic form Q' by 

Q' (v) = Q (Mv) = {Mvf A {Mv) = (M^AM) v 

with matrix A' = AM. We say that Q and Q' are equivalent forms, and notice that they 
take on the same values, since if Q' {v) = n then Q (Mv) = n and conversely if Q (w) = m then 
Q' (M~^t«) = m. Since the assumptions on M imply that detM = ±1, we must have 

det A' = det A 

so equivalent forms have the same determinant. 

For example, the Pell quadratic form Qs = (1,0, —3) is equivalent to the quadratic form Q' = 
(—2, —3, —3) since 

1 ^ /l \ /l 0\ /-2 -3 



1 II \0 -3J \1 IJ \-3 -3 

The form takes the value 1 at (2, 1)^ , and Q' takes the same value 1 at (2, —1)^ . The relationship 
between these two solutions is 

'1 0\ / 2 \ /2 



J 1/ v-iy VI 

Note that while Qs is balanced, Q' is not. 
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In practice we will focus on two very simple equivalences, using the following matrices: 
For a symmetric matrix 



we compute 

'l fa b\ fl Q\ fa + 2b + c b + c 

L' AL = I 

and 

^ "'^^^^0 Ij \b cj \Q lj^\a + b a + 2b + c 
Going from A to L'^AL we call a left step. Going from A to AR we call a right step 



^ l) \b c [l 1 -[ b + c c I ^'^> 



R^AR-d ]) (: V (4) 



3 Solving Pell's equation 



Here is our strategy for generating an infinite number of solutions to Pell's equation x — Dy = 1. 

1. First observe that e ~ (1, 0)"^ is a solution. 

2. Perform a suitable (non-empty) sequence of left and right steps, beginning with the Pell 
quadratic form Qd — (1,0,— Z?) with matrix ([2|), and ending also with = (1,0, —-D). 
This yields a matrix equation of the form 

N'^AN = A (5) 

for some invertible matrix N with strictly positive entries, actually of the form 

\v u 

3. Then observe that Ne = {u,v)^ , the first column of N, is also a solution, since 

{Nef A {Ne) = e^ {n'^ AN) e = e^ Ae = 1. 

4. Iterate to generate an infinite family of solutions, namely e, Ne, N'^e, ■ ■ ■ . 

Let us now explain what the suitable sequence of left and right steps is, and why we can be sure 
to return to after a finite number of such steps. The key is simply to ensure that at each stage 
we obtain a balanced form. 

If at some stage we have a balanced form (a, b, c), we compute the total 

T = a + 2b + c. 
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If T > we perform a left step. If T < we perform a right step. Then ([3]) and Q ensure that the 
new form {a',b',cJ) is also balanced, and it is clearly equivalent to (a, 6, c). 

The case T = cannot arise if the determinant of (a, b, c) is —D, since the identity 

-D = ac - b'^ = {a + 2b + c) c - {b + cf 

would then imply that D is a square. 

To record the step at each stage, we write either 

(a, b,c)L{a + 2b + c,b + c, c) 

or 

(a, b, c) R {a, a + b, a + 2b + c) . 

Suppose we have taken a step from the balanced quadratic form (a, b, c) to the balanced quadratic 
form (a', 6', c') . Can we tell whether this step was left or right? In the case of a left step a' — 2b'+c' = 
a > while in the case of a right step a' — 2b' + c' = c < 0. So the total of the quadratic form 
(a', —b',c') determines whether the step was left or right. 

If we start with the quadratic form Qu = (1; 0, —D) with determinant ~D < 0, and arrive after 
a certain number of steps to the quadratic form (a, 6, c) then we must have a > 0, c < and 

ac-b'^ = -D. 

Since ac is negative, there are only a finite number of integers a, b and c that satisfy this equation. 
So we must eventually return to a form that we have encountered before, and since we are capable 
of finding the inverse of any step, we must return first to the initial form, namely Qd- 

At the beginning with Qd the total isT=l — _D<Oso the first step must be a right step. 
Further right steps increase the last entry of c), so before we return to Qu we must have 
taken at least one left step. So the cumulative matrix N, which is the product of the matrices R 
and L corresponding to the steps we have taken, has strictly positive entries. This ensures that the 
sequence of solutions e, A^e, N'^e, • • • is increasing in each component, and so really does represent 
an infinite family of solutions. 

then more generally ([5]) gives us the identity 

(ax + /3t/)^ — D (jx + Sy)^ ~ x"^ ~ Dy^ 

which allows us to generate new solutions to Q [x, y) — n from existing ones, for any integer n. It 
also allows us to deduce that N has the form 

^^fu Dv\ 
\v U J 



4 



4 Example of L> = 2 

The simplest example of Pell's equation is 

a;2 - = 1. 

The ancient Greeks knew how to generate solutions. Geometrically we are looking for integer points 
on a hyperbola, as shown. 
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The hyperbola — = 1 



We begin with the Peh quadratic form Q2 = (1, 0, -2) with a total ofT=l + 2xO-2:=-l<0 
so we take a right step to get 

(l,0,-2)i?(l,l,-l). 
This corresponds to the matrix equation 

(0 -2) ^ = (1 -1) ■ 
The total is now r = l + 2xl — l = 2>0sowe take a left step to get 

(l,0,-2)i?(l,l,-l)L(2,0,-l). 
The total is now T = 1 so we take another left step 

(1,0, -2)i?(l,l,-l)L (2,0, -1)L (1,-1,-1). 
The total is now T = — 2 < so we take a right step 

(l,0,-2)i?(l,l,-l)L(2,0,-l)L(l,-l,-l)i?(l,0,-2) (6) 
to return to our starting quadratic form Q2. This yields the matrix equation 



So 
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gives the non-trivial solution v = Ne = (3, 2) to Pell's equation, and by taking higher powers of 

N wc get further solutions (17, u f . (99, 70)^ , (577,408)'^ and so on. 
More generally the form of N establishes the identity 

(3a; + 4yf -2{2x + 3yf = - 2y^ 

which allows us to generate an infinite number of solutions to Q2 {x, y) = n from any initial solution. 

5 Example of D = 7 

The quadratic form = (1, 0, — 7) gives the stepping sequence 

(1, 0, -7) i? (1, 1, -6) i? (1, 2, -3) L (2, -1, -3) 

i? (2, 1, -3) L (1, -2, -3) i? (1, -1, -6) i? (1, 0, -7) . 

This corresponds to the matrix equation 

{R^LRLR'fi^^ ^^R'LRLR^={^^ . 

The first column of 

N = R^LRLR^ = 

gives the solution (8, 3)"^ , and higher powers give also (127, 48)'^ , (2024, 765)^ and so on. Further- 
more we have the identity 

(8a; + 2lyf - 7 (3a; + Syf = x^ - ly"^ 
which generates further solutions to any solution of x^ — ly^ = n. 

6 Palindromic aspect 

The palindromic nature of the sequences occurring for A'' in the above two examples is a general 
feature, and follows from the fact that the inverse of a left step is 

(a', b', c') (a' - 2b' + c', b' - c', c') 

while the inverse of a right step is 

{a', b', c') R-^ {a', b' - a', a' - 2b' + c') . 

So if we negate the middle b components, the sequence reads the same from the end as from the 
beginning. To determine when we have arrived at the half-way point we look for symmetry: two 
adjacent or almost adjacent (separated by one) quadratic forms which differ only in the sign of the 
middle term. 

The central symmetry allows us to identify when the negative Pell equation x"^ — Dy^ = — 1 is 
soluble, and give a positive solution for it, if such exists. This will happen when the central part of 
the sequence looks like 

{a',b',c')iD,0,-l){a',-b',c') 



3 S 
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In that case N can be written as = MM'^ with M of the form 

\Ul Vl J 

so that its second cohimn gives (ui, vi) satisfying — Dvf = — 1. 
For the D = 2 case, © shows that 

M=RL=(^^ ;) 

giving (ui, Vl) — (1, 1). The D = 7 case does not have this form, and indeed x'^ — 7y'^ = —1 has no 
solutions, as —1 is not a square mod 7. 

We win further exploit this palindromic phenomenon in our last example. 

7 Example D = 61 

The Pell's equation x'^ — 61y^ — 1 was recognized by Fermat as being particularly challenging, and 
he posed it as a problem to his contemporaries, which was subsequently solved by Euler. However 
according to [2] the eleventh century Indian mathematician Bhaskara solved this problem. We 
shorten the presentation by combining powers of L and R when convenient. The required sequence 
of left and right steps exhibits an additional doubly palindromic aspect, in that each half is itself 
palindromic in an opposite way, that is with L and R symbols interchanged: 

(1, 0, -61) i?^ (1, 7, -12) L (3, -5,-12) R'^ (3, 7, -4) (9, -5, -4) 
R (9, 4, -5) L (12, ~1,-5)L (5, -6, -5) i? (5, -1, -12) i? (5, 4, -9) 
L (4, -5, -9) i?3 (4, 7, -3) L"* (12, -5, -3) R (12, 7, -1) (61, 0,-1) 

(12, -7, -1) R (12, 5, -3) (4, -7, -3) R^ (4, 5, -9) L (5, -4, -9) 
R (5, 1,-12) R (5, 6, -5) L (12, 1, -5) L (9, -4, -5) R (9, 5, -4) 

(3, -7, -4) i?" (3, 5, -12) L (1, -7, -12) i?^ (1, 0, -61) 

Computation of the required matrix N is shortened by the doubly palindromic aspect: 

TV R^LR'^L^RL^R^LR^L^RL^L^RL^R^LR^L^RL^R'^LR' 

_ /453 164\ /453 58\ / 21 58 \ /21 164\ 
"1^ 58 21 j 1^164 21) \IM 453/ 1^58 453/ 

_ /1766 319 049 13 795 392 780\ 
" 1^ 226153 980 1766 319 049 ) 

and its first column gives the smallest non-trivial solution x = 1766 319 049 and y — 226153 980. 
The occurrence of (61,0, —1) in the middle of the sequence shows that the negative Pell equation 
has a solution, and indeed the second column of 

A53 164\ /453 58\ _ /232 105 29 718\ 
~ 58 21 ) 21) ~\29 718 3805 ) 

satisfies (29 718)^ - 61 (3805)^ = -1. 
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8 Speeding up the algorithm 



The previous example suggests that for large D, it is more efficient to work out how many successive 
R or L steps are needed at each stage. Since 

- (; ") - (: ;) 

it is easy to compute that for arbitrary natural numbers n and m, 

(a, 6, c) i?" = (a, + an, c + 2bn + ari^) 
(a, 6, c) = (a + 26m + cm^, 6 + cm, c) . 

So if a right step i? is needed, namely ii T = a + 2b + c < 0, then we find the smallest k so that 
c + 26fc + ak^ > 0, and choose n = k — 1. If a left step L is needed, namely if T = a + 26 + c > 0, 
then we find the smallest / so that a + 2bl + cP < 0, and choose m = Z — 1. 

So for example at the first stage of the D = 61 case, (1, 0, —61) has a total of T = —61 < and 
since (—61) + fc^ > first when fc = 8, we set n = 8 — 1 = 7, giving 

(l,0,-61)i?^ = (1,7,-12). 

Now we know the next step must be L™ for some m, and we compute that 1 + 2x7/ — 12P < 
first when / = 2, so that m = 2 — 1 = 1, giving 

(1, 0, -61) i?^ (1, 7, -12) L = (3, -5,-12) 

and so on. 



9 Connections with the Stern-Brocot tree 

The Stern-Brocot tree (see 6 for a thorough discussion) is generated from the two 'fractions' 0/1 
and 1/0 by repeatedly performing the mediant operation. It naturally associates binary sequences 
in L and R with strictly positive reduced fractions, as shown for the correspondence 



RL^R^ -. 

5 




/\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ /\ 

1 2 3 34 55457 8J 787169ni0ni3 12 99 12 131110 n9 6 
6 9 11 10 11 13 12 9 9 12 13 11 10 11 9 6 5 7 8 7 7 8 7 5 4 5 5 4 3 3 2 1 

The Stern Brocot tree 
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The Stern-Brocot tree is closely related to the Euclidean algorithm and the theory of continued 
fractions, and there is a useful alternate form which contains 2x2 matrices instead of fractions. 
The matrix corresponding to a fraction a/h has columns which are the two immediate left and right 
predecessors of a/6 in the original tree, and whose columns sum to give the fraction, regarded as a 
vector. A binary sequence of L's and i?'s can be directly translated to a matrix by interpreting L 
and R to be the two matrices we used earlier, and simply performing the matrix multiplication. 

So computing N in the Pell's equation solution amounts to following a path in the matrix form 
of the Stern-Brocot tree. Each step involves adding one of the existing columns to the other, either 
to the left or to the right. Shown is the case 



N = RL^L 



3 4 
2 3 



which occurred in the solution of — 2y^ = 1. 
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Matrix form of the Stern-Brocot tree 



10 Geometric interpretation 

We have been able to avoid irrationalities because numbers of the form a + b^/T) behave arithmeti- 
cally exactly as do matrices of the form 

fa Db 
\b a 

This will be familiar to most readers in the imaginary case of 13 = — 1 , but it holds more generally, 
see for example [S^. It allows us to consider a quadratic extension of the rationals not only as 
a subfield of the complex numbers, but also as a field of rotation/dilation operators on a two- 
dimensional (non-Euclidean) geometry purely over the rational numbers. 

The usual approach to metrical geometry is to consider distance and angle as the fundamental 
concepts, but this is neither in the original spirit of the ancient Greeks, nor an approach that 
lends itself easily to generalization. Taking the quadrance -\-y^ as the fundamental measurement, 
instead of its square root, together with an algebraic replacement of angle called the spread between 
two lines, allows the development of a Euclidean theory of geometry that extends both to general 
fields, and also to arbitrary quadratic forms. In particular there is a very rich theory of Euclidean 
geometry over the rational numbers! 
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This new universal geometry is described in and |12| . Our discussion of Pell's equation thus 
has a direct geometric interpretation (actually this is the context in which it was discovered). So 
for example the matrix 

21' 

.3 



N = 



in the case of D = 7 amounts to a rotation in the geometry for which quadrance is defined by 
Q {x, y)=x'^ - 7y. 

Finally we point out that a quadratic irrational number such as V2 manifests itself as a periodic 
infinite path down the Stcrn-Brocot tree. A more general irrational number can be defined as a more 
general infinite path down the Stern-Brocot tree. This approach to the definition of irrationalities 
has the significant advantages that it involves no ambiguities related to equivalences, and that the 
initial paths of an irrational give automatically the best rational approximations to it. 
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